AITopics | convergence behavior

Mean Field for the Stochastic Blockmodel: Optimization Landscape and Convergence Issues

Neural Information Processing SystemsMar-17-2026, 01:37:08 GMT

Variational approximation has been widely used in large-scale Bayesian inference recently, the simplest kind of which involves imposing a mean field assumption to approximate complicated latent structures. Despite the computational scalability of mean field, theoretical studies of its loss function surface and the convergence behavior of iterative updates for optimizing the loss are far from complete. In this paper, we focus on the problem of community detection for a simple two-class Stochastic Blockmodel (SBM). Using batch co-ordinate ascent (BCAVI) for updates, we give a complete characterization of all the critical points and show different convergence behaviors with respect to initializations. When the parameters are known, we show a significant proportion of random initializations will converge to ground truth. On the other hand, when the parameters themselves need to be estimated, a random initialization will converge to an uninformative local optimum.

artificial intelligence, name change, proceedings, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.41)

Add feedback

New Insight into Hybrid Stochastic Gradient Descent: Beyond With-Replacement Sampling and Convexity

Pan Zhou, Xiaotong Yuan, Jiashi Feng

Neural Information Processing SystemsFeb-19-2026, 19:34:09 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, hsgd, ifo complexity, (12 more...)

Neural Information Processing Systems

Country:

Asia > China > Jiangsu Province > Nanjing (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > Singapore > Central Region > Singapore (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)

Add feedback

Mean Field for the Stochastic Blockmodel: Optimization Landscape and Convergence Issues

Neural Information Processing SystemsNov-20-2025, 23:06:27 GMT

Variational approximation has been widely used in large-scale Bayesian inference recently, the simplest kind of which involves imposing a mean field assumption to approximate complicated latent structures. Despite the computational scalability of mean field, theoretical studies of its loss function surface and the convergence behavior of iterative updates for optimizing the loss are far from complete. In this paper, we focus on the problem of community detection for a simple two-class Stochastic Blockmodel (SBM). Using batch co-ordinate ascent (BCAVI) for updates, we give a complete characterization of all the critical points and show different convergence behaviors with respect to initializations. When the parameters are known, we show a significant proportion of random initializations will converge to ground truth. On the other hand, when the parameters themselves need to be estimated, a random initialization will converge to an uninformative local optimum.

name change, optimization landscape and convergence issue, stochastic blockmodel, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.41)

Add feedback

New Insight into Hybrid Stochastic Gradient Descent: Beyond With-Replacement Sampling and Convexity

Pan Zhou, Xiaotong Yuan, Jiashi Feng

Neural Information Processing SystemsNov-20-2025, 17:03:14 GMT

As an incremental-gradient algorithm, the hybrid stochastic gradient descent (HS-GD) enjoys merits of both stochastic and full gradient methods for finite-sum problem optimization.

artificial intelligence, hsgd, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Asia > China > Jiangsu Province > Nanjing (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > Singapore > Central Region > Singapore (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)

Add feedback

1da546f25222c1ee710cf7e2f7a3ff0c-Supplemental.pdf

Neural Information Processing SystemsNov-20-2025, 08:30:13 GMT

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Genre: Workflow (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Benchmarking VQE Configurations: Architectures, Initializations, and Optimizers for Silicon Ground State Energy

Boutakka, Zakaria, Innan, Nouhaila, Shafique, Muhammed, Bennai, Mohamed, Sakhi, Z.

arXiv.org Artificial IntelligenceOct-28-2025

Quantum computing presents a promising path toward precise quantum chemical simulations, particularly for systems that challenge classical methods. This work investigates the performance of the Variational Quantum Eigensolver (VQE) in estimating the ground-state energy of the silicon atom, a relatively heavy element that poses significant computational complexity. Within a hybrid quantum-classical optimization framework, we implement VQE using a range of ansatz, including Double Excitation Gates, ParticleConservingU2, UCCSD, and k-UpCCGSD, combined with various optimizers such as gradient descent, SPSA, and ADAM. The main contribution of this work lies in a systematic methodological exploration of how these configuration choices interact to influence VQE performance, establishing a structured benchmark for selecting optimal settings in quantum chemical simulations. Key findings show that parameter initialization plays a decisive role in the algorithm's stability, and that the combination of a chemically inspired ansatz with adaptive optimization yields superior convergence and precision compared to conventional approaches.

artificial intelligence, initialization, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2510.23171

Country:

North America > United States (0.28)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Government > Regional Government (0.46)
Health & Medicine > Pharmaceuticals & Biotechnology (0.46)
Energy (0.46)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.49)

Add feedback

1 Analytical Derivation of the Convergence Behavior in the Case Study

Neural Information Processing SystemsOct-9-2025, 01:59:50 GMT

In this section, we briefly introduce the physical background of Equation (8) in Section 3.5 of the

artificial intelligence, equation, main paper, (16 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence (0.48)

Add feedback

Appendices for " Finding Second-Order Stationary Points Efficiently in Smooth Nonconvex Linearly Constrained Optimization Problems " A Details of Implementation of Algorithms

Neural Information Processing SystemsOct-2-2025, 09:10:54 GMT

In this section, we will elaborate more about the ideas of designing SNAP . First, we give the main motivation of selecting the update directions. Next, we will give the detailed algorithm description of the line search used in SNAP . A.2 Line Search Algorithm To understand the algorithm, let us first define the set of inactive constraints as A Lemma 2. If there exists an index i A (x Therefore, the line search algorithm reduces to the classic unconstrained update. If so, then the algorithm either touches the boundary without increasing the objective, or it has already achieved sufficient descent.

algorithm, artificial intelligence, optimization problem, (16 more...)

Neural Information Processing Systems

Genre: Workflow (0.68)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback

f3f27a324736617f20abbf2ffd806f6d-Supplemental.pdf

Neural Information Processing SystemsAug-17-2025, 06:41:00 GMT

artificial intelligence, machine learning, sup 0, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Asia > Singapore (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

Towards Theoretically Understanding Why S GD Generalizes Better Than A DAM in Deep Learning Pan Zhou

Neural Information Processing SystemsAug-17-2025, 06:40:52 GMT

In this work, we provide a new viewpoint for understanding the generalization performance gap.

artificial intelligence, deep learning, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Asia > Singapore (0.04)

Genre: Research Report > New Finding (0.94)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Collaborating Authors

convergence behavior

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Mean Field for the Stochastic Blockmodel: Optimization Landscape and Convergence Issues

New Insight into Hybrid Stochastic Gradient Descent: Beyond With-Replacement Sampling and Convexity

Mean Field for the Stochastic Blockmodel: Optimization Landscape and Convergence Issues

New Insight into Hybrid Stochastic Gradient Descent: Beyond With-Replacement Sampling and Convexity

1da546f25222c1ee710cf7e2f7a3ff0c-Supplemental.pdf

Benchmarking VQE Configurations: Architectures, Initializations, and Optimizers for Silicon Ground State Energy

1 Analytical Derivation of the Convergence Behavior in the Case Study

Appendices for " Finding Second-Order Stationary Points Efficiently in Smooth Nonconvex Linearly Constrained Optimization Problems " A Details of Implementation of Algorithms

f3f27a324736617f20abbf2ffd806f6d-Supplemental.pdf

Towards Theoretically Understanding Why S GD Generalizes Better Than A DAM in Deep Learning Pan Zhou